Detection of hypermutable nucleic acid sequence in tissue

ABSTRACT

An assay for detection of a mammalian cell proliferative disorder associated with a hypermutable nucleic acid sequences is provided. The identification of particular hypermutable sequences such as microsatellite loci correlates with a particular cancer, thereby allowing detection of both primary tumors and metastatic sites within a patient.

This is a continuation of application Ser. No. 08,299,477, filed Aug.31, 1994, now abandoned.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates generally to detection of a target nucleicacid sequence and specifically to detection of a cell proliferativedisorder associated with a hypermutable nucleic acid sequence in asample.

2. Description of Related Art

Mammalian genomes consist of unique DNA sequences interspersed withmoderately and highly repetitive DNA sequences. Gene mapping by meioticlinkage analysis has traditionally been carried out using variations inunique sequence DNA, such as restriction fragment length polymorphisms(Botstein, et al., Am. J. Hum. Genet., 32:314-331, 1980), as geneticmarkers. Recently, variations in the repetitive sequence elements suchas minisatellite or variable number tandem repeat (VNTR) sequences(Jeffreys, et al., Nature, 314:67-73, 1985; Nakamura, et al, Science,235:1616-1622, 1987), and microsatellite or variable simple sequencemotifs (VSSM) (Litt and Luty, Am. J. Hum. Genet., 44:397-401, 1989;Weber and May, Am. J. Hum. Genet., 44:388-396, 1989) have been found tobe useful for linkage studies. One advantage to the use of repetitivesequence variations rather than unique sequence variations is theapparently greater number of alleles present in normal populations whencompared to restriction fragment length polymorphisms (RFLPs). A secondadvantage is the ability to readily detect sequence length variationsusing the polymerase chain reaction to facilitate the rapid andinexpensive analysis of large numbers of DNA samples.

Microsatellite elements consist of simple mono-, di-, or tri- nucleotidesequences where alleles differ by one or more repeat units (Luty, etal., Am.J. Hum. Genet., 46:776-783, 1990; Tautz, et al., Nature,322:652-656,1986; Weber and May, Am. J. Hum. Genet., 44:388-396, 1989).Minisatellites, or VNTR sequences, typically have a repeat unit of 20 toseveral hundred nucleotides and alleles differ by as little as onerepeat unit. Among simple sequences, the (TG)_(n) or (CA)_(n) repeatelements have recently proven extremely useful for meiotic mapping since(1) they are abundant in the genome, (2) display a large number ofdifferent alleles, and (3) can be rapidly assayed using the polymerasechain reaction (Litt and Luty, Am. J. Hum. Genet., 44:397-401,1989;Weber and May, Am. J. Hum. Genet., 44:388-396, 1989).

A number of other short sequence motifs have been found in mammaliangenomes (Hellman, et al., Gene, 68:93-100, 1988; Knott, et al., Nuc.Acids Res., 14:9215-9216,1986; Litt and Luty, Am. J. Hum. Genet.,44:397-401,1989; Milstein, et al., Nuc. Acids Res., 12:6523-6535, 1984;Stoker, et al., Nuc. Acids Res., 13:4613-4621, 1985; Vassart, et al.,Science, 233:683-684, 1987; and Vergnaud, Nuc. Acids Res., 17:7623-7630,1989), and avian genomes (Gyllensten, et al, Nuc. Acids Res.,17:2203-2214, 1989; Longmire, et al., Genomics, 2:14-24, 1988) and arethought to accumulate by DNA slippage during replication (Tautz, et al.,Nature, 322:652-656, 1986) or unequal recombination events (Wolff, etal., Genomics, 5:382-384,1989). Many of these repeat elements display ahigh degree of genetic variation and, thus, are also useful for meioticand mitotic mapping.

The VNTR sequence isolated by Jeffreys, (supra) contains an invariantcore sequence GGGCAGGAXG which bears some similarities to the chisequence of phage lambda (Wolff, et al., Genomics, 5:382-384, 1989) andis detected by a restriction fragment of bacteriophage M13 (Vassart, etal., Science, 233:683-684, 1987). Similar repeat elements have beendetected by Nakamura, et al. (Science, 235:1616-1622, 1987) and containa similar, but distinctive, common core unit GGG--GTGGGG. Elements ofthis type occur within several known gene sequences including the βglobin locus. Similar VNTR elements have been described within theapolipoprotein B (Boerwinkle, et al., Proc. Natl. Acad. Sci. USA,86:212-216,1989; Knott, et al., Nuc. Acids Res., 14:9215-9216, 1986) andcollagen type II genes (Stoker, et al., Nuc. Acids Res., 13:4613-4621,1985) and contain a distinct AT-rich motif. Though a physiologicalfunction for repetitive elements of this type has not been defined, theyhave been suggested as potential hot spots for chromosome recombination(DeBustros, et al., Proc. Natl. Acad. Sci. USA, 85:5693-5697,1988) orelements important for the control of gene expression (Hellman, et al.,Gene, 68:93-100, 1988; Milstein, et al., Nuc. Acids Res., 12:6523-6535,1984).

Microsatellites represent a very common and highly polymorphic class ofgenetic elements in the human genome. Microsatellite markers containingrepeat sequences have been used for primary gene mapping and linkageanalysis as described (Weber, et al., Am. J. Human Genet., 44:388,1989). PCR amplification of these repeats allows rapid assessment forloss of heterozygosity (LOH) and can greatly simplify procedures formapping tumor suppressor genes (Ruppert, et al., Cancer Res. 53:5093,1993; van der Riet, et al., Cancer Res., 54:1156, 1994). More recently,microsatellites have been used to identify specific mutations in certaininherited disorders including Huntington's disease (HD), fragile Xchromosome (FX), myotonic dystrophy (MD), spinocerebellar ataxia type I(SCA1), spino-bulbar muscular dystrophy (SBMA) and hereditarydentatorubralpallidoluysian atrophy (DRPLA) (The Huntington's DiseaseCollaborative Research Group., Cell, 72:971, 1993; E. J. Kremer, et al.,Science, 252:1711, 1991; G. Imbert, et al., Nature Genet., 4:72, 1993);H. T. Orr, et al., Nature Genet., 4:221, 1993); V. Biancalana, et al.,Hum. Mol. Genet., I:255, 1992, M-Y,. Chung, et al, Nature Genet., 5:254,1993, R. Koide, et al., Nature Genet., 6:9, 1994).

Microsatellite instability has recently been described in human cancers.For example, microsatellite instability has been reported to be animportant feature of tumors from hereditary non-polyposis colorectalcarcinoma (HNPCC) patients (Peltomaki, et al., Science, 260:810, 1993;Aaltonen, et al., Science, 260:812, 1993; Thibodeau, et al., Science,260:816, 1993). Moreover, microsatellite instability, demonstrated byexpansion or deletion of repeat elements has been reported incolorectal, endometrial, breast, gastric, pancreatic, bladder neoplastictissues (J. I. Risinger, et al., Cancer Res., 53:5100, 1993; H-J. Han,et al., Cancer Res., 53:5087, 1993; P. Peltomaki, et al., Cancer Res.,53:5853, 1993; M. Gonzalez-Zulueta, et al., Cancer Res., 53:5620, 1993),and recently in SCLC. In HNPCC patients, this genetic instability is dueto inherited and somatic mutations of a critical mismatch repair gene(hMSH-2). Mutations of hMLH-1 and other critical mismatch repair genesmay also be responsible for the instability detected in HNPCC patients.

Cancer remains a major cause of mortality worldwide, and despiteadvancements in diagnosis and treatment, the overall survival rate hasnot improved significantly in the past twenty years. There remains anunfulfilled need for a more sensitive means of early diagnosis. Typicalassays to detect rare infiltrating tumor cells in clinical samplesutilize amplification methods that require additional cloning steps andsynthesis of a large number of oligomer specific probes to detect a widevariety of oncogenic mutations for each tumor type (Sidransky, et al.,Science, 252:706, 1991; Sidransky, et al., Science, 256:102, 1992). Thepresent invention provides a sensitive assay to detect a variety ofcancers using hypermutable microsatellite markers and an amplificationstrategy which eliminates the need for additional cloning steps.

SUMMARY OF THE INVENTION

The present invention provides a fast, reliable, sensitive screeningmethod for the detection of a cell proliferative disorder in variousclinical samples. The invention utilizes amplification of microsatellitenucleic acid (small repeat sequences) to detect a clonal population ofcells in a clinical sample.

The invention is based on the unexpected finding that microsatellitealterations are detectable as a clonal population of cells in the DNA ofcytologic clinical samples. These samples include urine, sputum, andhistopathologic margins obtained from cancer patients.

The invention provides a method for detecting a mammalian cellproliferative disorder (i.e., neoplasia) associated with a hypermutablemammalian target nucleic acid in a specimen, comprising isolating thenucleic acid present in the specimen and detecting the presence of thehypermutable target nucleic acid, typically following amplification ofthe nucleic acid.

In one embodiment, the amplification step in the method of the inventionis performed as a multiplex reaction. Therefore, instead of performingmultiple amplification reactions to identify each clonal alteration,primers for different markers are combined in one simple amplificationreaction, enhancing the identification of a large proportion of cellproliferative disorders.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a denaturing acrylamide gel of normal (N) and tumor (T) DNAfrom lane 1, B17 (TCC) (marker FGA); lane 2, L21 (SCLC) (marker AR);lane 3, B30 (TCC) (marker UT762); and lane 4, L5 (SCLG) (marker D14S50).

FIG. 2 shows a denaturing acrylamide gel of Panel A) a urine (U) sampleof B27 (TCC) analyzed with marker FGA and compared with normal (N) andtumor (T) tissue DNA; Panel B) histological margin (M) of L31 (NSCLC)screened with AR and compared with tumor tissue; and Panel C) sputum (S)sample of L25 (SCLC) analyzed with CHRNB and compared with normal andtumor tissue DNA.

FIG. 3 shows DNA from patients' lymphocytes (N) amplified with markerFGA at varying dilutions with tumor DNA from 1 in 5 (20%) to 1 in 1000(0.1%). Novel bands are indicated by an arrow. Panel A, tumor DNA fromB17 (TCC); panel B, tumor DNA of B27 (TCC).

FIG. 4 shows a multiplex PCR assay utilizing DNA from B27 (TCC) and acorresponding urine sample amplified with three different primer sets(FGA, ACTBP2, AR) in the same PCR reaction (N: normal DNA; T: tumor DNA;U: urine DNA).

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to a method of detecting a cellproliferative disorder associated with a target nucleic acid having ahypermutable nucleotide sequence. Preferably, the hypermutablenucleotide sequence of the invention is a microsatellite DNA sequence.Microsatellite alterations in various cancer lesions, for example, canbe detected by using known microsatellite repeat markers. A combinationof repeat markers can be utilized in the method of the invention toidentify a large percentage of cell proliferative disorders orneoplasias even if the neoplastic cells comprise only a small fractionof the clinical sample.

As used herein, the term "hypermutable" refers to a nucleic acidsequence that is susceptible to instability, thus resulting in nucleicacid alterations. Such alterations include the deletion and addition ofnucleotides. The hypermutable sequences of the invention are preferablymicrosatellite DNA sequences which, by definition, are small tandemrepeat DNA sequences.

The hypermutable nucleic acid may be a neoplastic nucleic acid sequence.The term "neoplastic" nucleic acid refers to a nucleic acid sequencewhich directly or indirectly is associated with or causes a neoplasm.The method of the invention is applicable to detection of hypermutablenucleotide sequences associated with benign as well as malignant tumors.The method can be used to detect any hypermutable nucleotide sequence,regardless of origin, as long as the sequence is detectably present in aspecimen. The specimen can be blood, urine, sputum, bile, stool,cervical smears, saliva, tears, cerebral spinal fluid, regional lymphnode and histopathologic margins, and any bodily fluid that drains abody cavity or organ. For example, neoplasia of regional lymph nodesassociated with a primary mammary tumor can be detected utilizing themethod of the invention. The term "regional lymph node" refers tolymphoid tissue forming lymphoid organs or nodes which are in closeproximity to the primary tumor. For example, regional lymph nodes in thecase of head and neck carcinomas include cervical lymph nodes,prelaryngeal lymph nodes, pulmonary juxtaesophageal lymph nodes andsubmandibular lymph nodes. Regional lymph nodes for mammary tissuecarcinomas include the axillary and intercostal nodes. The term"external to a primary neoplasm" means that the specimen is taken from asite other than directly from the primary neoplasm itself. Such specimenmay be useful in evaluating whether metastasis of the primary neoplasmhas occurred.

The method can also be used to detect a hypermutable nucleic acidsequence associated with a primary tumor by assaying the surroundingtumor margin. As used herein the term "tumor margin" refers to thetissue surrounding a discernible tumor. In the case of surgical removalof a solid tumor, the tumor margin is the tissue cut away with thediscernible tumor that usually appears to be normal to the naked eye.More particularly, as used herein, "margin" refers to the edge, border,or boundary of a tumor. The margin generally extends from about 0.2 cmto about 3 cm from the primary tumor but can be greater depending uponthe size of the primary solid tumor.

In its broadest sense, the present invention allows the detection of anyhypermutable target nucleic acid sequence of diagnostic or therapeuticrelevance, where the target nucleic acid sequence is present in a tissuesample. The target nucleotide sequence may be, for example, arestriction fragment length polymorphism (RFLP), nucleotide deletion,nucleotide addition, or any other mammalian nucleic acid sequence ofinterest in such tissue specimens. Preferably, the microsatellite,hypermutable nucleic acid of the invention contains nucleic aciddeletions or additions.

The hypermutable microsatellites most preferred in the method of theinvention comprises the sequence (X)_(n), wherein X is the number ofnucleotides in the repeat sequence and is greater than or equal to 1,preferably greater than or equal to 2, and most preferably greater thanor equal to 3 and wherein n is the number of repeats and is greater thanor equal to 2, and preferably from 4 to 6. Preferably, when X is 2, thenucleotide sequence is TC. Preferably, when X is 3, the nucleotidesequence is selected from AGC, TCC, CAG, CAA, and CTG. Preferably when Xis 4, the nucleotide sequence is selected from AAAG, AGAT and TCTT.

The hypermutable nucleic acid sequence is preferably associated with aknown locus. For example, hypermutable microsatellite alterations may bedetected using a marker selected from ARA (chromosome X), D14S50(chromosome 14), AR (chromosome X), MD (chromosome 19), SAT (chromosome6), DRPLA (chromosome 12), ACTBP2 (chromosome 6), FGA (chromosome 4),D4S243 (chromosome 4), and UT762 (chromosome 21). Tandem repeatsequences have been identified as associated with Huntington's disease(HD), fragile X syndrome (FX), myotonic dystrophy (MD), spinocerebellarataxia type I (SCA1), spino-bulbar muscular dystrophy, and hereditarydentatorubralpallidoluysian atrophy (DRPLA). When the nucleotidesequence of X is larger, it is more likely that the microsatellite locuswill have alterations, e.g., a trinucleotide repeat is more likely tohave deletions or additions than a dinucleotide repeat. Thus, in thepresent invention, it has been found that 8% of 25 trinucleotide ortetranucleotide markers displayed microsatellite alternations per 100tumor specimens examined, whereas only 0.7% of 83 dinucleotidemicrosatellite markers were found to contain alterations. In addition,it has been found that a regular repeat, such as AAT AAT AAT is morelikely to be hypermutable than a sequence which contains interruptionsin the repeat sequence, e.g., AAT GAC AAT AAT. Consequently, those ofordinary skill in the art can readily identify other hypermutable targetnucleic acid sequences by considering the size of the candidate sequenceand whether the sequence is uninterrupted without resorting to undueexperimentation. Other microsatellite markers will be known by thecriteria described herein and are accessible to those of skill in theart. Smaller microsatellite markers including dinucleotide andmononucleotide repeats may be hypermutable and useful for this analysis.

The present invention identifies hypermutable target sequences,preferably microsatellite loci, that are unique to a particular cellularproliferative disorder, primary tumor, or metastatic sites derived fromthe primary tumor. In the tumor cell, the hypermutable nucleotidesequence is evidenced by nucleic acid deletions or expansion of repeatsequences as compared to a normal cell; therefore, it is possible todesign appropriate diagnostic techniques directed to the specificsequence and to design therapeutic strategies once the sequence isidentified.

The term "cell-proliferative disorder" denotes benign as well asmalignant cell populations which morphologically often appear to differfrom the surrounding tissue. For example, the method of the invention isuseful in detecting malignancies of the various organ systems, such as,for example, lung, breast, lymphoid, gastrointestinal, andgenito-urinary tract as well as epithelial carcinomas which includemalignancies such as most colon cancers, renal-cell carcinoma, prostatecancer, non-small cell carcinoma of the lung, cancer of the smallintestine, cancer of the head and neck, stomach cancer, bladder cancer,kidney cancer, cervical cancer, cancer of the esophagus and any otherorgan type that has a draining fluid or tissue accessible to analysis.The method of the invention is also useful in detecting non-malignantcell-proliferative diseases such as colon adenomas, hyperplasia,dysplasia and other "pre-malignant" lesions. Essentially, any disorderwhich is etiologically linked to a hypermutable microsatellite locuswould be considered susceptible to detection.

When it is desired to amplify the target nucleotide sequence beforedetection, such as a hypermutable nucleotide sequence, this can beaccomplished using oligonucleotides that are primers for amplification.The oligonucleotide primers are designed based upon identification ofthe nucleic acid sequence of the flanking regions contiguous with thehypermutable nucleotide sequence. For example, in the case ofhypermutable microsatellite nucleic acid sequences, oligonucleotideprimers comprise sequences which are capable of hybridizing withnucleotide sequences flanking the loci of mutations, such as thefollowing nucleotide sequences:

    ______________________________________    a.    5'-CTT GTG TCC CGG CGT CTG-3'          (SEQ ID NO: 1);    b.    5'-C AGC CCA GCA GGA CCA GTA-3'          (SEQ ID NO: 2);    c.    5'-TGG TAA CAG TGG ATT ACT GAC-3'          (SEQ ID NO: 3);    d.    5'-ACT GAT GCA AAA ATC CTC AAC-3'          (SEQ ID NO: 4);    e.    5'-GA TGG GCA AAC TGC AGG CCT GGG AAG-3'          (SEQ ID NO: 5);    f.    5'-GCT ACA AGG ACC CTT CGA GCC CCG TTG-3'          (SEQ ID NO: 6);    g.    5'GAT GGT GAT GTG TTG AGA CTG GTG-3'          (SEQ ID NO: 7);    h.    5'-GAG CAT TTC CCC ACC CAC TGG AGG-3'          (SEQ ID NO: 8);    i.    5'-GTT CTG GAT CAC TTC GCG GA-3'          (SEQ ID NO: 9);    j.    5'-TGA GGA TGG TTC TCC CCA AG-3'          (SEQ ID NO: 10);    k.    5'-AGT GGT GAA TTA GGG GTG TT-3'          (SEQ ID NO: 11);    l.    5'-CTG CCA TCT TGT GGA ATC AT-3'          (SEQ ID NO: 12);    m.    5'-CTG TGA GTT CAA AAC CTA TGG-3'          (SEQ ID NO: 13);    n.    5'-GTG TCA GAG GAT CTG AGA AG-3'          (SEQ ID NO: 14);    o.    5'-GCA CGC TCT GGA ACA GAT TCT GGA-3'          (SEQ ID NO: 15);    p.    5'-ATG AGG AAC AGC AAC CTT CAC AGC-3'          (SEQ ID NO: 16);    q.    5'-TCA CTC TTG TCG CCC AGA TT-3'          (SEQ ID NO: 17);    r.    5'-TAT AGC GGT AGG GGA GAT GT-3'          (SEQ ID NO: 18);    s.    5'-TGC AAG GAG AAA GAG AGA CTG A-3'          (SEQ ID NO: 19);    t.    5'-AAC AGG ACC ACA GGC TCC TA-3'          (SEQ ID NO: 20); and    u.    sequences complementary to sequences a. through t.    ______________________________________

Primers that hybridize to these flanking sequences are, for example, thefollowing:

    ______________________________________    a.    5'-CAG ACG CCG GGA CAC AAG-3'          (SEQ ID NO: 21);    b.    5'-TAC TGG TCC TGC TGG GCT G-3'          (SEQ ID NO: 22);    c.    5'-GTC AGT ATT ACC CTG TTA CCA-3'          (SEQ ID NO: 23);    d.    5'-GTT GAG GAT TTT TGC ATC AGT-3'          (SEQ ID NO: 24);    e.    5'-CTT CCC AGG CCT GCA GTT TGC CCA TC-3'          (SEQ ID NO: 25);    f.    5'-GAA CGG GGC TCG AAG GGT CCT TGT AGC-3'          (SEQ ID NO: 26);    g.    5'-CAC CAG TCT CAA CAC ATC ACC ATC-3'          (SEQ ID NO: 27);    h.    5'-CCT CCA GTG GGT GGG GAA ATG CTC-3'          (SEQ ID NO: 28);    i.    5'-TCC GCG AAG TGA TCC AGA AC-3'          (SEQ ID NO: 29);    j.    5'-CTT GGG GAG AAC CAT CCT CA-3'          (SEQ ID NO: 30);    k.    5'-AAC ACC CCT AAT TCA CCA CT-3'          (SEQ ID NO: 31);    l.    5'-ATG ATT CCA CAA GAT GGC AG-3'          (SEQ ID NO: 32);    m.    5'-CCA TAG GTT TTG AAC TCA CAG-3'          (SEQ ID NO: 33);    n.    5'-CTT CTC AGA TCC TCT GAC AC-3'          (SEQ ID NO: 34);    o.    5'-TCC AGA ATC TGT TCC AGA GCG TGC-3'          (SEQ ID NO: 35);    p.    5'-GCT GTG AAG GTT GCT GTT CCT CAT-3'          (SEQ ID NO: 36);    q.    5'-AAT CTG GGC GAC AAG AGT GA-3'          (SEQ ID NO: 37);    r.    5'-ACA TCT CCC CTA CCG CTA TA-3'          (SEQ ID NO: 38);    s.    5'-TCA GTC TCT CTT TCT CCT TGC A-3'          (SEQ ID NO: 39);    t.    5'-TAG GAG CCT GTG GTC CTG TT-3'          (SEQ ID NO: 40); and    u.    sequences complementary to sequences a. through t.    ______________________________________

One skilled in the art will be able to generate primers suitable foramplifying target sequences of additional nucleic acids, such as thoseflanking loci of known microsatellite sequences, using routine skillsknown in the art and the teachings of this invention.

In general, the primers used according to the method of the inventionembrace oligonucleotides of sufficient length and appropriate sequencewhich provide specific initiation of polymerization of a significantnumber of nucleic acid molecules containing the target nucleic acidunder the conditions of stringency for the reaction utilizing theprimers. In this manner, it is possible to selectively amplify thespecific target nucleic acid sequence containing the nucleic acid ofinterest. Specifically, the term "primer" as used herein refers to asequence comprising two or more deoxyribonucleotides or ribonucleotides,preferably at least eight, which sequence is capable of initiatingsynthesis of a primer extension product that is substantiallycomplementary to a target nucleic acid strand. The oligonucleotideprimer typically contains 15-22 or more nucleotides, although it maycontain fewer nucleotides as long as the primer is of sufficientspecificity to allow essentially only the amplification of thespecifically desired target nucleotide sequence (i.e., the primer issubstantially complementary).

Experimental conditions conducive to synthesis include the presence ofnucleoside triphosphates and an agent for polymerization, such as DNApolymerase, and a suitable temperature and pH. The primer is preferablysingle stranded for maximum efficiency in amplification, but may bedouble stranded. If double stranded, the primer is first treated toseparate its strands before being used to prepare extension products.Preferably, the primer is an oligodeoxyribonucleotide. The primer mustbe sufficiently long to prime the synthesis of extension products in thepresence of the inducing agent for polymerization. The exact length ofprimer will depend on many factors, including temperature, buffer, andnucleotide composition.

Primers used according to the method of the invention are designed to be"substantially" complementary to each strand of mutant nucleotidesequence to be amplified. Substantially complementary means that theprimers must be sufficiently complementary to hybridize with theirrespective strands under conditions which allow the agent forpolymerization to function. In other words, the primers should havesufficient complementarily with the flanking sequences to hybridize withand permit amplification of the nucleotide sequence. Preferably, the 3'terminus of the primer that is extended has perfectly base pairedcomplementarity with the complementary flanking strand.

Oligonucleotide primers used according to the invention are employed inany amplification process that produces increased quantities of targetnucleic acid. Typically, one primer is complementary to the negative (-)strand of the mutant nucleotide sequence and the other is complementaryto the positive (+) strand. Annealing the primers to denatured nucleicacid followed by extension with an enzyme, such as the large fragment ofDNA Polymerase I (Klenow) or Taq DNA polymerase and nucleotides orligases, results in newly synthesized (+) and (-) strands containing thetarget nucleic acid. Because these newly synthesized nucleic acids arealso templates, repeated cycles of denaturing, primer annealing, andextension results in exponential production of the region (i.e., thetarget hypermutable nucleotide sequence) defined by the primer. Theproduct of the amplification reaction is a discrete nucleic acid duplexwith termini corresponding to the ends of the specific primers employed.Those of skill in the art will know of other amplification methodologieswhich can also be utilized to increase the copy number of target nucleicacid.

The oligonucleotide primers for use in the invention may be preparedusing any suitable method, such as conventional phosphotriester andphosphodiester methods or automated embodiments thereof. In one suchautomated embodiment, diethylphosphoramidites are used as startingmaterials and may be synthesized as described by Beaucage, et al.(Tetrahedron Letters, 22:1859-1862, 1981). One method for synthesizingoligonucleotides on a modified solid support is described in U.S. Pat.No. 4,458,066. One method of amplification which can be used accordingto this invention is the polymerase chain reaction (PCR) described inU.S. Pat. Nos. 4,683,202 and 4,683,195.

The nucleic acid from any specimen, in purified or nonpurified form, canbe utilized as the starting nucleic acid or acids, provided it contains,or is suspected of containing, the specific nucleic acid sequencecontaining the target nucleic acid. Thus, the process may employ, forexample, DNA or RNA, including messenger RNA (mRNA), wherein DNA or RNAmay be single stranded or double stranded. In the event that RNA is tobe used as a template, enzymes, and/or conditions optimal for reversetranscribing the template to DNA would be utilized. In addition, aDNA-RNA hybrid which contains one strand of each may be utilized. Amixture of nucleic acids may also be employed, or the nucleic acidsproduced in a previous amplification reaction herein, using the same ordifferent primers may be so utilized. The mutant nucleotide sequence tobe amplified may be a fraction of a larger molecule or can be presentinitially as a discrete molecule, such that the specific sequenceconstitutes the entire nucleic acid. It is not necessary that thesequence to be amplified be present initially in a pure form; it may bea minor fraction of a complex mixture, such as contained in whole humanDNA.

Where the target neoplastic nucleotide sequence of the sample containstwo strands, it is necessary to separate the strands of the nucleic acidbefore it can be used as the template. Strand separation can be effectedeither as a separate step or simultaneously with the synthesis of theprimer extension products. This strand separation can be accomplishedusing various suitable denaturing conditions, including physical,chemical, or enzymatic means; the word "denaturing" includes all suchmeans. One physical method of separating nucleic acid strands involvesheating the nucleic acid until it is denatured. Typical heatdenaturation may involve temperatures ranging from about 80° to 105° C.for times ranging from about 1 to 10 minutes. Strand separation may alsobe induced by an enzyme from the class of enzymes known as helicases orby the enzyme RecA, which has helicase activity, and in the presence ofriboATP which is known to denature DNA. The reaction conditions suitablefor strand separation of nucleic acids with helicases are described byKuhn Hoffmann-Berling (CSH-Quantitative Biology, 43:63, 1978) andtechniques for using RecA are reviewed in C. Radding (Ann. Rev.Genetics, 16:405-437, 1982).

If the nucleic acid containing the target nucleic acid to be amplifiedis single stranded, its complement is synthesized by adding one or twooligonucleotide primers. If a single primer is utilized, a primerextension product is synthesized in the presence of primer, an agent forpolymerization, and the four nucleoside triphosphates described below.The product will be complementary to the single-stranded nucleic acidand will hybridize with a single-stranded nucleic acid to form a duplexof unequal length strands that may then be separated into single strandsto produce two single separated complementary strands. Alternatively,two primers may be added to the single-stranded nucleic acid and thereaction carried out as described.

When complementary strands of nucleic acid or acids are separated,regardless of whether the nucleic acid was originally double or singlestranded, the separated strands are ready to be used as a template forthe synthesis of additional nucleic acid strands. This synthesis isperformed under conditions allowing hybridization of primers totemplates. Generally synthesis occurs in a buffered aqueous solution,preferably at a pH of 7-9, most preferably about 8. Preferably, a molarexcess (for genomic nucleic acid, usually about 10⁸ :1 primer:template)of the two oligonucleotide primers is added to the buffer containing theseparated template strands. It is understood, however, that the amountof complementary strand may not be known if the process of the inventionis used for diagnostic applications, so that the amount of primerrelative to the amount of complementary strand cannot be determined withcertainty. As a practical matter, however, the amount of primer addedwill generally be in molar excess over the amount of complementarystrand (template) when the sequence to be amplified is contained in amixture of complicated long-chain nucleic acid strands. A large molarexcess is preferred to improve the efficiency of the process.

In some amplification embodiments, the substrates, for example, thedeoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP, are addedto the synthesis mixture, either separately or together with theprimers, in adequate amounts and the resulting solution is heated toabout 90°-100° C. from about 1 to 10 minutes, preferably from 1 to 4minutes. After this heating period, the solution is allowed to cool toroom temperature, which is preferable for the primer hybridization. Tothe cooled mixture is added an appropriate agent for effecting theprimer extension reaction (called herein "agent for polymerization"),and the reaction is allowed to occur under conditions known in the art.The agent for polymerization may also be added together with the otherreagents if it is heat stable. This synthesis (or amplification)reaction may occur at room temperature up to a temperature above whichthe agent for polymerization no longer functions. Thus, for example, ifDNA polymerase is used as the agent, the temperature is generally nogreater than about 40° C. The agent for polymerization may be anycompound or system which will function to accomplish the synthesis ofprimer extension products, including enzymes. Suitable enzymes for thispurpose include, for example, E. coli DNA polymerase I, Taq polymerase,Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, otheravailable DNA polymerases, polymerase muteins, reverse transcriptase,ligase, and other enzymes, including heat-stable enzymes (i.e., thoseenzymes which perform primer extension after being subjected totemperatures sufficiently elevated to cause denaturation). Suitableenzymes will facilitate combination of the nucleotides in the propermanner to form the primer extension products which are complementary toeach mutant nucleotide strand. Generally, the synthesis will beinitiated at the 3' end of each primer and proceed in the 5' directionalong the template strand, until synthesis terminates, producingmolecules of different lengths. There may be agents for polymerization,however, which initiate synthesis at the 5' end and proceed in the otherdirection, using the same process as described above. In any event, themethod of the invention is not to be limited to the embodiments ofamplification which are described herein.

The newly synthesized mutant nucleotide strand and its complementarynucleic acid strand will form a double-stranded molecule underhybridizing conditions described above and this hybrid is used insubsequent steps of the process. In the next step, the newly synthesizeddouble-stranded molecule is subjected to denaturing conditions using anyof the procedures described above to provide single-stranded molecules.

The above process is repeated on the single-stranded molecules.Additional agent for polymerization, nucleotides, and primers may beadded, if necessary, for the reaction to proceed under the conditionsprescribed above. Again, the synthesis will be initiated at one end ofeach of the oligonucleotide primers and will proceed along the singlestrands of the template to produce additional nucleic acid. After thisstep, half of the extension product will consist of the specific nucleicacid sequence bounded by the two primers.

The steps of denaturing and extension product synthesis can be repeatedas often as needed to amplify the target hypermutable nucleotidesequence to the extent necessary for detection. The amount of thehypermutable nucleotide sequence produced will accumulate in anexponential fashion.

In one embodiment of the invention, a combination of hypermutablemicrosatellite markers are amplified in a single amplification reaction.The markers are "multiplexed" in a single amplification reaction, forexample, by combining primers for more than one locus. For example, DNAfrom a urine sample is amplified with three different randomly labelledprimer sets such as FGA, ACTBP2 and AR, in the same amplificationreaction. The products are ultimately separated on a denaturingacrylamide gel and then exposed to film for visualization and analysis.

The amplified product may be detected by Southern blot analysis, withoutusing radioactive probes. In such a process, for example, a small sampleof DNA containing a very low level of microsatellite hypermutablenucleotide sequence is amplified, and analyzed via a Southern blottingtechnique. The use of non-radioactive probes or labels is facilitated bythe high level of the amplified signal. In a preferred embodiment of theinvention, one nucleoside triphosphate is radioactively labeled, therebyallowing direct visualization of the amplification product byautoradiography. In another embodiment, amplification primers arefluorescent labelled and run through an electrophoresis system.Visualization of amplified products is by laser detection followed bycomputer assisted graphic display.

Nucleic acids having a hypermutable microsatellite sequence detected inthe method of the invention can be further evaluated, detected, cloned,sequenced, and the like, either in solution or after binding to a solidsupport, by any method usually applied to the detection of a specificDNA sequence such as PCR, oligomer restriction (Saiki, et al.,Bio/Technology, 3:1008-1012, 1985), allele-specific oligonucleotide(ASO) probe analysis (Conner, et al., Proc. Natl. Acad. Sci. USA,80:278, 1983), oligonucleotide ligation assays (OLAs) (Landegren, etal., Science, 241:1077, 1988), and the like. Molecular techniques forDNA analysis have been reviewed (Landegren, et al., Science,242:229-237, 1988).

In another embodiment of the invention, purified nucleic acid fragmentsthat contain oligonucleotide sequences of 10-50 bases frommicrosatellite loci, are radioactively labelled. The labelledpreparations are used to probe nucleic acid by the Southernhybridization technique. Nucleotide fragments from a specimen, before orafter amplification, are separated into fragments of different molecularmasses by gel electrophoresis and transferred to filters which bindnucleic acid. After exposure to the labelled probe, which will hybridizeto nucleotide fragments containing target nucleic acid sequences,binding of the radioactive probe to target nucleic acid fragments isidentified by autoradiography (see Genetic Engineering, 1, ed. RobertWilliamson, Academic Press, (1981), 72-81).

Probes for microsatellite loci of the present invention can be used forexamining the distribution of the specific fragments detected, as wellas the quantitative (relative) degree of binding of the probe fordetermining the occurrence of specific strongly binding (hybridizing)sequences, thus indicating the presence of extensive alterations in aparticular locus.

For the most part, the probe (or amplification primer) will be labelledwith an atom or inorganic radical, most commonly using radionuclides,but also perhaps heavy metals. Conveniently, a radioactive label may beemployed. Radioactive labels include ³² P, ¹²⁵ I, ³ H, ¹⁴ C, ³⁵ S, orthe like. Any radioactive label may be employed which provides for anadequate signal and has sufficient half-life. Other labels includeligands, which can serve as a specific binding pair member for alabelled ligand, and the like. A wide variety of labels have beenemployed in immunoassays which can readily be employed in the presentassay. The choice of the label will be governed by the effect of thelabel on the rate of hybridization and binding of the probe to mutantnucleotide sequence. It will be necessary that the label providesufficient sensitivity to detect the amount of mutant nucleotidesequence available for hybridization. Other considerations will be easeof synthesis of the probe, readily available instrumentation, ability toautomate, convenience, and the like.

The manner in which the label is bound to the probe will vary dependingupon the nature of the label. For a radioactive label, a wide variety oftechniques can be employed. Commonly employed is nick translation withan α-³² P-dNTP or terminal phosphate hydrolysis with alkalinephosphatase followed by labeling with radioactive ³² P employing γ-³²P-ATP and T4 polynucleotide kinase. Alternatively, nucleotides can besynthesized where one or more of the elements present are replaced witha radioactive isotope, e.g., hydrogen with tritium. If desired,complementary labelled strands can be used as probes to enhance theconcentration of hybridized label.

Where other radionuclide labels are involved, various linking groups canbe employed. A terminal hydroxyl can be esterified, with inorganicacids, e.g., ³² P phosphate, or ¹⁴ C organic acids, or else esterifiedto provide linking groups to the label. Alternatively, intermediatebases may be substituted with activatable linking groups which can thenbe linked to a label.

Enzymes of interest as reporter groups will primarily be alkalinephosphatase, hydrolases, particularly esterases and glycosidases, oroxidoreductases, particularly peroxidases. Fluorescent compounds includefluorescein and its derivatives, rhodamine and its derivatives, dansyl,umbelliferone, and so forth.

Chemiluminescers include, for example, luciferin, and 2,3-dihydrophthalazinediones (e.g., luminol).

An oligomer probe can be employed for hybridizing to a nucleotidesequence affixed to a water insoluble porous support. Depending upon thesource of the nucleic acid, the manner in which the nucleic acid isaffixed to the support may vary. Those of ordinary skill in the artknow, or can easily ascertain, different supports which can be used inthe method of the invention.

The nucleic acid from a specimen is spotted or spread onto a filter toprovide a plurality of individual portions. The filter is an inertporous solid support, e.g., nitrocellulose or nylon membranes. Anymammalian cells present in the specimen are treated to liberate theirnucleic acid. The lysing and denaturation of nucleic acid, as well asthe subsequent washings, can be achieved with an appropriate solutionfor a sufficient time to lyse the cells and denature the nucleic acid.Other denaturation agents include elevated temperatures, organicreagents (e.g., alcohols, amides, amines, ureas, phenols and sulfoxides)or certain inorganic ions (e.g., thiocyanate and perchlorate).Alternatively, nucleic acid can be isolated from blood using standardprocedures with the DNA subsequently applied to the membrane.

After denaturation, the filter is washed in an aqueous bufferedsolution, such as Tris, generally at a pH of about 6 to 8, usually 7.One or more washings may be involved, conveniently using the sameprocedure as employed for the lysing and denaturation. After the lysing,denaturing, and washes have been accomplished, the nucleic acid spottedfilter is dried at an elevated temperature, generally from about 50° C.to 70° C. or UV-crosslinked (for nylon membranes). Under this procedure,the nucleic acid is fixed in position and can be assayed with the probewhen convenient.

Pre-hybridization may be accomplished by incubating the filter at amildly elevated temperature for a sufficient time with the hybridizationsolution without the probe to thoroughly wet the filter. Varioushybridization solutions may be employed, comprising from about 20% to60% volume, preferably 30%, of an inert polar organic solvent or aqueoushybridization solutions.

The particular hybridization technique is not essential to theinvention. Other hybridization techniques are well known or easilyascertained by one of ordinary skill in the art. As improvements aremade in hybridization techniques they can readily be applied in themethod of the invention.

The amount of labelled probe which is present in the hybridizationsolution will vary widely, depending upon the nature of the label, theamount of the labelled probe which can reasonably bind to the filter,and the stringency of the hybridization. Generally, substantial excessover stoichiometric concentrations of the probe will be employed toenhance the rate of binding of the probe to the fixed target nucleicacid.

Various degrees of stringency of hybridization may be employed. The morestringent the conditions, the greater the complementarity that isrequired for hybridization between the probe and the single strandedtarget nucleic acid sequence for duplex formation. Stringency can becontrolled by temperature, probe concentration, probe length, ionicstrength, time, and the like. Conveniently, the stringency ofhybridization is varied by changing the polarity of the reactantsolution by manipulating the concentration of formamide in the range of20% to 50%. Temperatures employed will normally be in the range of about20° C. to 80° C., usually 30° C. to 75° C. (see, generally, CurrentProtocols in Molecular Biology, Ausubel, ed., Wiley & Sons, 1989).Alternatively, stringency can be controlled when the non-annealed probeis washed off.

After the filter has been contacted with a hybridization solution at amoderate temperature for a period of time sufficient to allowhybridization to occur, the filter is then introduced into a secondsolution having sodium chloride, sodium citrate and sodiumdodecylsulfate. The time for which the filter is maintained in thesecond solution may vary from five minutes to three hours or more. Thesecond solution and the temperature (generally 50° C. below the meltingtemperature) determines the stringency, dissolving duplexes, and shortcomplementary sequences. For short oligonucloetide probes, the meltingtemperature can be standarized according to probe length, rather thansequence, by including tetramethyl ammonium chloride in the washsolution (DiLella and Woo, Meth. Enzymol., 152:447, 1987). The filtermay now be assayed for the presence of duplexes in accordance with thenature of the label. Where the label is radioactive, the filter is driedand exposed to X-ray film.

The materials for use in the assay of the invention are ideally suitedfor the preparation of a kit. Such a kit may comprise a carrier meansbeing compartmentalized to receive in close confinement one or morecontainer means such as vials, tubes, and the like, each of thecontainer means comprising one of the separate elements to be used inthe method.

For example, one of the container means may comprise amplificationprimers for a microsatellite locus or a hybridization probe, all ofwhich can be detectably labelled. If present, a second container maycomprise a lysis buffer. The kit may also have containers containingnucleotide(s) for amplification of the target nucleic acid sequencewhich may or may not be labeled, and/or a container comprising areporter-means, such as a biotin-binding protein, such as avidin orstreptavidin, bound to a reporter molecule, such as an enzymatic,florescent, or radionuclide label.

The above disclosure generally describes the present invention. A morecomplete understanding can be obtained by reference to the followingspecific examples which are provided herein for purposes of illustrationonly and are not intended to limit the scope of the invention.

EXAMPLES

During primary mapping studies using 43 dinucleotide markers in over 300tumors including SCC, non-small cell lung carcinoma (NSCLC),transitional cell carcinoma of the bladder (TCC), and squamous and basalcell skin carcinoma, single somatic alterations were observed inapproximately 0.7% of tumor DNA This low rate of "background"alterations in dinucleotide markers is consistent with other studiesdemonstrating few changes in non-HNPCC-associated tumors (S. N.Thibodeau, et al., Science, 260:816, 1993; J. I. Risinger, et al.,Cancer Res., 53:5100, 1993; H-J. Han, et al., Cancer Res., 53:5087,1993); P. Peltomaki, et al., Cancer Res., 53: 5853, 1993); M.Gonzalez-Zulueta, et al., Cancer Res., 53:5620, 1993; R. Wooster, etal., Nature Gent., 6:152,1994) and germine DNA (J. L. Weber, et al., Am.J. Human Genet., 44:388, 1989), J. Weissenback, et al., Nature, 359:794,1992; (A. J. Jeffreys, et al, Nature Genet., 6:136, 1994). 35 SCC, 20NSCLC, 10 SCLC, and 32 TCC normal and cancer DNA pairs were tested usingtri- and tetranucleotide repeat markers (TABLE 1). Of these,approximately 8% and 20% of tumor DNA had somatic alterations using tri-and tetranucleotide repeat-markers, respectively. Of the nine markers,four (MD, DRPLA, AR (SBMA), and SAT (SCA1)) were associated withneurological disease and have revealed germine expansion of their repeatsequences in affected patients. Because of these germline alterations,it was thought that these markers might be more susceptible to expansionor deletion in tumor DNA as compared with dinucleotide repeat sequences.The other markers were chosen among commercially available tri- ortetranucleotide repeat markers except for UT762 (on human chromosome21), which was previously reported to have an excess of germlinealterations on human chromosome 21 (C. C. Talbot, Jr., et al., 43rdAmerican Society of Human Genetics Meeting, Abstract, 1993).

EXAMPLE 1 PRIMERS UTILIZED FOR AMPLIFICATION OF SPECIMEN DNA

All tumors were fresh-frozen except for SCLC which wasparaffin-embedded. Non-neoplastic tissue was microdissected away to beused as normal DNA. Alternatively, fresh blood was obtained andlymphocytes separated. Tumor and normal tissue was digested with 1%SDS-proteinase K followed by ethanol precipitation to extract DNA. 50 ngof DNA was subject to PCR amplification; the products were run ondenaturing acrylamide gels as described previously (P. van der Riet, etal., Cancer Res., 54:1156, 1994), H. Nawroz, P., Cancer Res., 54:1152,1994; P. Cairns, et al., Cancer Res., 54:1422, 1994).

Primers used to amplify each locus were obtained from Research Genetics,Inc., with the exception of the following loci:

    ______________________________________    a.    5'-CAG ACG CCG GGA CAC AAG-3'          (SEQ ID NO: 21);    b.    5'-TAC TGG TCC TGC TGG GCT G-3'          (SEQ ID NO: 22);    c.    5'-GTC AGT ATT ACC CTG TTA CCA-3'          (SEQ ID NO: 23);    d.    5'-GTT GAG GAT TTT TGC ATC AGT-3'          (SEQ ID NO: 24);    e.    5'-CTT CCC AGG CCT GCA GTT TGC CCA TC-3'          (SEQ ID NO: 25);    f.    5'-GAA CGG GGC TCG AAG GGT CCT TGT AGC-3'          (SEQ ID NO: 26);    g.    5'-CAC CAG TCT CAA CAC ATC ACC ATC-3'          (SEQ ID NO: 27);    h.    5'-CCT CCA GTG GGT GGG GAA ATG CTC-3'          (SEQ ID NO: 28);    i.    5'-TCC GCG AAG TGA TCC AGA AC-3'          (SEQ ID NO: 29);    j.    5'-CTT GGG GAG AAC CAT CCT CA-3'          (SEQ ID NO: 30);    k.    5'-AAC ACC CCT AAT TCA CCA CT-3'          (SEQ ID NO: 31);    l.    5'-ATG ATT CCA CAA GAT GGC AG-3'          (SEQ ID NO: 32);    m.    5'-CCA TAG GTT TTG AAC TCA CAG-3'          (SEQ ID NO: 33);    n.    5'-CTT CTC AGA TCC TCT GAC AC-3'          (SEQ ID NO: 34);    o.    5'-TCC AGA ATC TGT TCC AGA GCG TGC-3'          (SEQ ID NO: 35);    p.    5'-GCT GTG AAG GTT GCT GTT CCT CAT-3'          (SEQ ID NO: 36);    q.    5'-AAT CTG GGC GAC AAG AGT GA-3'          (SEQ ID NO: 37);    r.    5'-ACA TCT CCC CTA CCG CTA TA-3'          (SEQ ID NO: 38);    s.    5'-TCA GTC TCT CTT TCT CCT TGC A-3'          (SEQ ID NO: 39);    t.    5'-TAG GAG CCT GTG GTC CTG TT-3'          (SEQ ID NO: 40); and    u.    sequences complementary to sequences a. through t.    ______________________________________

P. Modrich, Annu. Rev. Genet, 25:229, (1991). Cytological samples werespun at 3000×g for 5', and washed with PBS twice. Cell pellets weredigested with 1% SDS-proteinase K, and DNA extracted as describedpreviously (D. Sidransky, et al., Science, 252:706, 1991), Science,256:102, 1992; Surgical margin DNA was obtained from slides which werehistopathologically negative. Tissue was scraped and placed in xylene toremove excess paraffin. After centrifugation with one quarter volume 70%ethanol, pellets were digested and DNA extracted as described previously(D. Sidransky, supra, R. H. Hruban, supra, L. Mao, supra).

EXAMPLE 2 DETECTION OF ALTERATIONS IN MICROSATELLITE LOCI

Each microsatellite locus was amplified in paired normal/tumor DNA byPCR and labelled products were then run on denaturing acrylamide gelsand exposed to film. 29% of head and neck cancers, 5% of NSCLC, 50% ofSCLC, and 28% of bladder tumors exhibited microsatellite alterations inat least one susceptible marker (TABLE 1). These genetic alterationswere identified as a novel band (or bands) in the tumor DNA lane andwere not present in the paired normal DNA lane (FIG. 1). FIG. 1 showsmicrosatellite alterations in tumor DNA. Normal and tumor DNA wereamplified by PCR and run on denaturing acrylamide gels as described (P.van der Riet, supra). Novel bands representing deletion or expansion oftandem repeat sequences were seen in all four tumor lanes as indicatedby the arrows. Lane 1) B17 (TCC) with marker FGA on chromosome 4; Lane2) L21 (SCLC) with marker AR on chromosome X; Lane 3) B30 (TCC) withmarker UT762 on chromosome 21; and Lane 4) L5 (SCLC) with marker D14S50on chromosome 14. (N: normal DNA. T: tumor DNA).

For each case, the amplification was repeated and the alterationsreproduced. The frequency of alterations was significantly higher inthese tri- or tetranucleotide repeat markers and also appeared to betumor-type specific (TABLE 1). In TCC, for example, the AR repeat isaltered in 3% of tumors whereas 18% of SCC's displayed alterations atthis locus. A significant difference in the frequency of alterationsbetween disease and non-disease related tri- and tetranucleotide repeatsequences was not observed suggesting that there might exist a moregeneralized cellular mechanism for these changes, rather than inherentsequence differences in the repeat regions.

SCLC displayed generalized microsatellite instability similar to thatseen in HNPCC-associated tumors and altered a high percentage of allmarkers including dinucleotides (A. Merlo, et al., supra).

Although widespread microsatellite instability has been found mostfrequently in HNPCC, other non-HNPCC tumors contain occasionalalterations. These changes usually involve only one locus, rather thanthe multiple loci typically seen in HNPCC patients (A. Merlo, et al.,supra; R. Wooster, et al., supra). The widespread instability found inSCLC is an exception, and the genetic basis for this is still unknown.In this study, none of the tumors we tested were HNPCC-associated.Comparison of dinucleotide repeat (29 of 4171 tested) versus tri- ortetranucleotide repeat alterations (44 of 874 tested) in these 97 tumorsreveals a significant susceptibility to genetic instability in thelarger alleles (p=0.08×10⁻⁹ by x² analysis). The high frequency ofmicrosatellite alterations found here suggests that certain loci may beinherently more unstable than others. The mechanism producing thealtered alleles in these tumors may differ from that described in HNPCC(F. Leach, et al., Cell, 75:1215, 1993; R. Fishel, et al., Cell,75:1027, 1993) or could reflect more subtle defects in similar orrelated repair pathways (P. Modrich, Annu. Rev. Genet., 25:229, 1991).From these data, it appears that occasional microsatellite alterationsmay be a relatively generally phenomenon in many human cancers with someloci altered in a tumor-type specific manner.

                                      TABLE 1    __________________________________________________________________________    FREQUENCY OF MICROSATELLITE ALTERATIONS                        HEAD AND NECK    BLADDER           NAME   REPEAT                        SCC      NSCLC                                     SCLC                                         TCC   Total    REPEAT SIZE           (CHROMO)                  SEQUENCE                        (n = 35) (n = 23)                                     (n = 10)                                         (N = 32)                                               (n = 100)    __________________________________________________________________________    Trinucleotide           ARA (X)                  (AGC)  2 (6%)  1 (4%)                                     --  1 (3%)                                                4 (4%)           D14S50 (14)                  (TCC) 7,                         1 (3%)  1 (4%)                                     0   0      2 (2%)                  (TC) 9                  (AC) 6,G                  (CA) 12.5           AR (X)*                  (CAG) 19                         6 (18%) 1 (4%)                                     4 (40%)                                         1 (3%)                                                2 (2%)                  (CAA)           MD (19)*                  (CTG)  0       1 (4%)                                     --  2 (6%)                                                3 (3%)           SAT (6)*                  (CAG)  4 (11%) 1 (4%)                                     --  1 (3%)                                                6 (6%)           DRPLA (12)*                  (CAG)  1 (3%)  0   --  0      1 (1%)    Tetranucleotide           ACTBP2 (6)                  (AAAG) 11,                         1 (3%)  1 (4%)                                     2 (20%)                                         3 (9%)                                                7 (7%)                  (AAAG) 15           FGA (4)                  (TCTT) 13                         2 (6%)  1 (4%)                                     0   3 (9%)                                                6 (6%)           UT762 (21)                  (AAAG)                         0       1 (4%)                                     4 (40%)                                         5 (15%)                                               10 (10%)                  Total # of                        10 (29%) 2 (9%)                                     5 (50%)                                         9 (28%)                                               26 (26%)                  tumors with                  alterations in                   locus    __________________________________________________________________________     Microsatellite alterations for each marker tested a number of alterations     detected in each tumor type.     SCC: head and neck squamous cell carcinoma.     NSCLC: nonsmall cell lung carcinoma.     SCLC: small cell lung carcinoma.     TCC: transitional cell carcinoma of the bladder.     26 of 100 (26%) total tumors displayed alterations in at least one locus.     *Disease related.

EXAMPLE 3 DETECTION OF CLONAL POPULATIONS OF TUMOR-DERIVED CELLS

These clonal microsatellite alterations were studied to see if theycould be detected in cytologic samples as tumor-specific markers. Todemonstrate this potential clinical application, several correspondingcytological samples that were considered negative for the presence ofcancer cells by light microscopy were analyzed. When DNA from bladdertumor B27 was screened with the tetranucleotide marker FGA, a novel bandwas identified in the tumor lane as compared to normal (FIG. 2A). FIG. 2shows detection of clonal microsatellite alterations in clinicalsamples. PCR conditions and gel analysis have been described (P. van derRiet, et al., supra). Novel bands are indicated by arrows. Panel A), Thecorresponding cytologic urine sample of B27 (TCC) was analyzed usingmarker FGA and compared with normal and tumor DNA. A novel or "shifted"band is seen in the tumor lane vs. the normal lane and at a lesser, butsignificant intensity in the urine lane. Panel (B), Lymphocytic DNA wasnot available from this patient, but a light band is clearly identifiedin the "negative" histological margin of L31 (NSCLC) which correspondsto the more intense novel band in the tumor lane when screened withmarker AR. Panel (C), Amplification of marker CHRNB; the correspondingsputum sample of 15 (SCLC) revealed two light bands consistent with thenovel bands in the tumor lane. These bands are not present in the normalDNA. (N: normal DNA, U: urine DNA, T: tumor DNA, M: margin DNA, S:sputum DNA).

DNA obtained from the patient's urine sample before surgery was screenedwith the same marker, revealing the same novel band at a lower intensityin the urine DNA (D. Sidransky, et al., supra). The intensity of theshifted band was approximately 5% of the intensity of the correspondingband in tumor DNA, indicating that only a small population of cells inthe urine sample was tumor-derived. One patient with SCLC was found tohave a novel CHRNB allele in the tumor DNA when compared to normal DNA.When DNA from a corresponding, prospectively collected sputum sample wasscreened with the marker CHRNB, the identical genetic change in thesputum DNA initially found in the patient's primary SCLC (FIG. 2c).Again, the lower intensity of the novel bands in sputum suggested thatonly a small fraction of cancer cells was present in the sample. Severalhistopathologically negative surgical margins were examined (D.Sidransky, et al., supra). The tumor DNA of L29 demonstrates a novel,smaller band at the microsatellite marker IFN, while paired DNA of ahistologically negative surgical margin presented the same shifted bandat lower intensity (FIG. 2B). This is consistent with the previousobservation that undetected infiltrating tumor cells can be identifiedin surgical margins after "complete" surgical resection with sensitivemolecular techniques.

These examples demonstrate the ability to detect clonal populations oftumor-derived cells in cytologic samples and histopathologic tissue.This assay was readily reproducible using other altered markers to testcorresponding samples in these cases and paired samples from otherpatients. Clinical samples from patients without a microsatellitealteration detected in the primary tumor were consistently negative.

EXAMPLE 4 SENSITIVITY DETERMINATION BY DILUTION

An obvious problem in screening clinical samples is the need to detectan extremely small number of cancer cells among a large background ofnormal cells, especially in bodily fluids such as urine and sputum. Todemonstrate the sensitivity of the method of the invention, the DNA ofbladder tumors B17, which has a larger novel allele, and B27, which hasa smaller novel allele were utilized. These two samples were chosenbecause of our observation that smaller alleles tend to amplify betterthan larger alleles by PCR. Tumor DNA was diluted with normal,lymphocytic DNA from the same patient and 50 ng of DNA from eachdilution was amplified by PCR.

FIG. 3 shows sensitivity determination by simple dilution. DNA fromtumors (T) containing alterations were diluted with the correspondingpatients' lymphocyte DNA (N) from 1 in 5 (20%) to 1 in 1000 (0.1%).Samples were then amplified by PCR with marker FGA, separated bydenaturing gel electrophoresis and visualized by autoradiography asdescribed (P. van der Riet, et al., supra). Novel bands are indicated byan arrow. Panel (A) shows the novel band seen in the tumor lane of B17(TCC) is still visible when diluted with its normal corresponding DNA to0.1%. Similarly in Panel (B), the novel band in the tumor DNA of B27(TCC) is clearly seen when diluted to 0.5%. The "shifted" band was seenin the dilution mixture containing only 0.1% of tumor DNA in B17 and0.5% tumor DNA in B27. These results suggest that this method maypotentially detect one cancer cell among 200 to 1000 normal cells, thusproving its potential usefulness as a clinical screening assay.

EXAMPLE 5 MULTIPLEX PCR ASSAY FOR DETECTION OF HYPERMUTABLE NUCLEIC ACID

Microsatellite alterations in HNPCC and SCLC could be detected by usingjust a few microsatellite repeat markers since these tumors alter a highpercentage of all tested alleles. For non-HNPCC tumors, a singlewell-selected tri- or tetranucleotide repeat marker could potentiallyidentify over 15% of tumors for a particular cancer type. In SCC andTCC, seven selected markers detected over 28% of tumors (TABLE 1).Because the genetic mechanism underlying these occasional alterations isstill not known, it is possible that some cancers will not display anyalterations regardless of how many markers are tested. Nevertheless, thehuman genome contains over 100,000 repeat regions and it is very likelythat other candidate markers could be identified for this analysis. Acombination of repeat markers could be utilized to identify a highpercentage of cancer patients; the ability to potentially test severalmarkers in a single PCR reaction could simplify the ultimate screeningapproach.

DNA from B27 (TCC) and a corresponding urine sample was amplified withthree different primers sets (FGA on chromosome 4, ACTBP2 on chromosome6, and AR on the X chromosome) in the same PCR reaction, separated ondenaturing acrylamide gels and exposed to film. The concentration ofeach primer was diluted to 100 ng/μg in the final PCR reaction. A novelband in the tumor lane is identical to the corresponding, less intenseband in the urine DNA lane. (N: normal DNA, T: tumor DNA, U: urine DNA).

The feasibility of this approach is shown in FIG. 4, where the threemarkers are multiplexed in a single PCR reaction. The results clearlydemonstrate a novel band in urine DNA identical to the altered allele inthe corresponding primary TCC tumor.

This assay is much simpler to perform that previous PCR-based assaysfollowed by cloning and oligomer-specific hybridization to detectoncogene mutations (D. Sidransky, et al., supra). Although thesensitivity of this assay is slightly diminished, with cancer celldetection limited to a background of approximately 500 normal cells ascompared to 10,000 normal cells using the previous approach, evidencefrom prior studies suggests that this is sufficient to detect cancercells in most clinical specimens including sputum (FIG. 3). Moreover,rare oligoclonal events perhaps secondary to inflammation or hyperplasiawould not be detected because of their dilution among an excessbackground of normal cells devoid of these alterations. Althoughoncogene mutations that provide neoplastic cells with a distinct growthadvantage are not specifically detected, monoclonality is a fundamentalcharacteristic of all neoplasms and detection of clonal cell populationsin cytologic samples remains an ominous sign (P. J. Fialkow, Biochem.Biophys. Acta., 458:283, 1976; P. C. Nowell, Science, 94:23, 1976). Theaccumulation of genetic events in subsequent daughter cells is wellrecognized and a detectable clone would be expected to persist andprobably continue along the neoplastic progression pathway (E. R.Fearon, et al, Cell, 61:709, 1990; D. Sidransky, et al., Nature,355:846, 1992; D. Sidransky, et al., N. Engl. J. Med., 326:737, 1992).

Although many of the genetic events in colorectal cancer progression areknown (E. R. Fearon, et al., supra), few events have beenwell-characterized in most other tumor types and not all occur in agiven tumor. The ability to detect early clonal cell populations inpatients without precise knowledge of the specific gene mutations in theprimary tumor is a major strength of this assay. Indeed, these areprecisely the patients that may be ideal candidates for chemopreventivestrategies and/or amenable to surgical resection with careful follow-up.Moreover, detection of rare infiltrating cancer cells in histopathologicmargins may have great impact on current surgical practice. Because theprimary tumor is already resected in these cases, rapid screening oftumor DNA can provide a single marker for detection of theseinfiltrating tumor cells.

The present invention indicates that microsatellite alterations appearto be a common feature in human cancers and that larger repeats areprobably more prone to this type of genetic instability. The highfrequency of alterations observed in several markers appear to betumor-type specific; the selection of appropriate markers with arelatively high rate of instability for a given tumor type may allow theuse of a multiplex PCR test to identify a high percentage of neoplasmsin patients. The identification of these alterations in bodily fluidsand surgical margins attests to their potential use as clonal markers inthe detection of neoplastic cells. A simple and powerful screening testapplicable to a variety of cancers and pathologic samples isdemonstrated by the present invention. Because cancer is so prevalent inthe population, this molecular approach has important implications forcancer detection.

The invention now being fully described, it will be apparent to one ofordinary skill in the art that many changes and modifications can bemade without departing from the spirit or scope of the invention.

    __________________________________________________________________________    #             SEQUENCE LISTING    - (1) GENERAL INFORMATION:    -    (iii) NUMBER OF SEQUENCES: 40    - (2) INFORMATION FOR SEQ ID NO:1:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 18 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:    #  18              TCTG    - (2) INFORMATION FOR SEQ ID NO:2:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 19 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:    # 19               CAGTA    - (2) INFORMATION FOR SEQ ID NO:3:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 21 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:    #21                TACTGA C    - (2) INFORMATION FOR SEQ ID NO:4:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 21 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:    #21                CCTCAA C    - (2) INFORMATION FOR SEQ ID NO:5:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 26 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:    #              26  AGGCCT GGGAAG    - (2) INFORMATION FOR SEQ ID NO:6:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 27 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:    #             27   TCGAGC CCCGTTC    - (2) INFORMATION FOR SEQ ID NO:7:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:    #                24GAGACT GGTG    - (2) INFORMATION FOR SEQ ID NO:8:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:    #                24CCACTG GAGG    - (2) INFORMATION FOR SEQ ID NO:9:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:    # 20               CGCGGA    - (2) INFORMATION FOR SEQ ID NO:10:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:    # 20               CCCAAG    - (2) INFORMATION FOR SEQ ID NO:11:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:    # 20               GGTGTT    - (2) INFORMATION FOR SEQ ID NO:12:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:    # 20               AATCAT    - (2) INFORMATION FOR SEQ ID NO:13:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 21 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:    #21                CCTATG G    - (2) INFORMATION FOR SEQ ID NO:14:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:    # 20               GAGAAG    - (2) INFORMATION FOR SEQ ID NO:15:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:    #                24AGATTC TGGA    - (2) INFORMATION FOR SEQ ID NO:16:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:    #                24CCTTCA CAGC    - (2) INFORMATION FOR SEQ ID NO:17:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:    # 20               CAGATT    - (2) INFORMATION FOR SEQ ID NO:18:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:    # 20               AGATGT    - (2) INFORMATION FOR SEQ ID NO:19:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 22 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:    #                 22AGACT GA    - (2) INFORMATION FOR SEQ ID NO:20:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:    # 20               CTCCTA    - (2) INFORMATION FOR SEQ ID NO:21:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 18 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:    #  18              CAAG    - (2) INFORMATION FOR SEQ ID NO:22:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 19 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:    # 19               GGCTG    - (2) INFORMATION FOR SEQ ID NO:23:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 21 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:    #21                GTTACC A    - (2) INFORMATION FOR SEQ ID NO:24:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 21 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:    #21                CATCAG T    - (2) INFORMATION FOR SEQ ID NO:25:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 26 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:    #              26  AGTTTG CCCATC    - (2) INFORMATION FOR SEQ ID NO:26:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 27 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:    #             27   GGGTCC TTGTAGC    - (2) INFORMATION FOR SEQ ID NO:27:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:    #                24CATCAC CATC    - (2) INFORMATION FOR SEQ ID NO:28:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:    #                24GGAAAT GCTC    - (2) INFORMATION FOR SEQ ID NO:29:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:    # 20               CAGAAC    - (2) INFORMATION FOR SEQ ID NO:30:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:    # 20               TCCTCA    - (2) INFORMATION FOR SEQ ID NO:31:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:    # 20               ACCACT    - (2) INFORMATION FOR SEQ ID NO:32:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:    # 20               TGGCAG    - (2) INFORMATION FOR SEQ ID NO:33:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 21 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:    #21                CTCACA G    - (2) INFORMATION FOR SEQ ID NO:34:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:    # 20               TGACAC    - (2) INFORMATION FOR SEQ ID NO:35:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:    #                24CAGAGC GTGC    - (2) INFORMATION FOR SEQ ID NO:36:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 24 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:    #                24TGTTCC TCAT    - (2) INFORMATION FOR SEQ ID NO:37:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:    # 20               GAGTGA    - (2) INFORMATION FOR SEQ ID NO:38:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:    # 20               GCTATA    - (2) INFORMATION FOR SEQ ID NO:39:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 22 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:    #                 22CCTTG CA    - (2) INFORMATION FOR SEQ ID NO:40:    -      (i) SEQUENCE CHARACTERISTICS:    #pairs    (A) LENGTH: 20 base              (B) TYPE: nucleic acid              (C) STRANDEDNESS: single              (D) TOPOLOGY: linear    -     (ii) MOLECULE TYPE: cDNA    -    (iii) HYPOTHETICAL: NO    -     (iv) ANTI-SENSE: NO    -      (v) FRAGMENT TYPE:    -     (vi) ORIGINAL SOURCE:    -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:    # 20               CCTGTT    __________________________________________________________________________

I claim:
 1. A kit useful for the detection of a cell proliferativedisorder of a tissue selected from the group consisting of head, neck,lung, and bladder wherein the detection comprises testing of a pluralityof microsatellite repeat target nucleic acid markers from a specimen,wherein the specimen is external to a primary neoplasm and contains aneoplastic nucleic acid present in the primary neoplasm, the kitcomprising a carrier means and one or more containers containing aplurality of pairs of oligonucleotide primers each of said primerscomprising at least 15 nucleotides which hybridize to flanking nucleicacid sequences of a microsatellite repeat target nucleic acid marker,wherein the microsatellite repeat target nucleic acid markers areselected from the group consisting of ARA, D14S50, AR, MD, SAT, DRPLA,ACTBP2, FGA, and UT762.
 2. The kit of claim 1, wherein the kit furthercomprises a detectably labeled deoxynucleotide.
 3. The kit of claim 1,flanking sequences are selected from the group consisting of SEQ ID NO:ID Nos: 1-40.
 4. The kit of claim 1, wherein the kit comprises at leastthree pairs of oligonucleotide primers each of said pairs hybridizing toflanking nucleic acids of a distinct microsatellite repeat.
 5. The kitof claim 1, wherein the kit comprises at least seven pairs ofoligonucleotide primers each of said pairs hybridizing to flankingnucleic acids of a distinct microsatellite repeat.
 6. The kit of claim1, wherein the kit comprises at least ten pairs of oligonucleotideprimers each of said pairs hybridizing to flanking nucleic acids of adistinct microsatellite repeat.
 7. A method for detecting bladder cancerin a subject comprising the steps of:comparing (a) the lengths of aplurality of microsatellite markers from a nucleic acid sample isolatedfrom a urine specimen of the subject with (b) the lengths of saidplurality of microsatellite markers from a control specimen of saidsubject, wherein said microsatellite markers comprise a repeat sequenceselected from the group consisting of AGC, CAG, CAA, CTG, AAAG, andTCTT; identifying the presence of bladder cancer if a variation inlength is found between a marker from the control specimen and the urinespecimen.
 8. A method for detecting bladder cancer in a subjectcomprising the steps of:comparing (a) the lengths of a plurality ofmicrosatellite markers from a nucleic acid sample isolated from a urinespecimen of the subject with (b) the lengths of said plurality ofmicrosatellite markers from a control specimen of said subject, whereinsaid microsatellite markers are selected from the group consisting of:ARA, AR, MD, SAT, ACTBB2, FGA, and UT762; identifying the presence ofbladder cancer if a variation in length is found between a marker fromthe control specimen and the urine specimen.
 9. A method for detectingbladder cancer in a subject comprising the steps of:comparing (a) thelengths of a plurality of microsatellite markers from a nucleic acidsample isolated from a bladder specimen of the subject with (b) thelengths of said plurality of microsatellite markers from a controlspecimen of said subject, wherein said microsatellite markers comprise arepeat sequence selected from the group consisting of: AGC, CAG, CAA,CTG, AAAG, and TCTT; identifying the presence of bladder cancer if avariation in length is found between a marker from the control specimenand the bladder specimen.
 10. A method for detecting bladder cancer in asubject comprising the steps of:comparing (a) the lengths of a pluralityof microsatellite markers from a nucleic acid sample isolated from abladder specimen of the subject with (b) the lengths of said pluralityof microsatellite markers from a control specimen of said subject,wherein said microsatellite markers are selected from the groupconsisting of: ARA, AR, MD, SAT, ACTBB2, FGA, and UT762; identifying thepresence of bladder cancer if a variation in length is found between amarker from the control specimen and the bladder specimen.
 11. A methodfor detecting Small Cell Lung Cancer (SCLC) in a subject comprising thesteps of:comparing (a) the lengths of a plurality of microsatellitemarkers from a nucleic acid sample isolated from a sputum specimen ofthe subject with (b) the lengths of said plurality of microsatellitemarkers from a control specimen of said subject, wherein saidmicrosatellite markers comprise a repeat sequence selected from thegroup consisting of: CAG, CAA, and AAAG; identifying the presence ofSCLC if a variation in length is found between a marker from the controlspecimen and the sputum specimen.
 12. A method for detecting SCLC cancerin a subject comprising the steps of:comparing (a) the lengths of aplurality of microsatellite markers from a nucleic acid sample isolatedfrom a sputum specimen of the subject with (b) the lengths of saidplurality of microsatellite markers from a control specimen of saidsubject, wherein said microsatellite markers are selected from the groupconsisting of: AR, ACTBB2, and UT762; identifying the presence of SCLCif a variation in length is found between a marker from the controlspecimen and the sputum specimen.
 13. A method for detecting Non-smallCell Lung Cancer (NSCLC) in a subject comprising the steps of:comparing(a) the lengths of a plurality of microsatellite markers from a nucleicacid sample isolated from a sputum specimen of the subject with (b) thelengths of said plurality of microsatellite markers from a controlspecimen of said subject, wherein said microsatellite markers comprise arepeat sequence selected from the group consisting of: AGC, TCC, TC, AC,CA, CAG, CAA, CTG, CAG, AAAG, and TCTT; identifying the presence ofNSCLC if a variation in length is found between a marker from thecontrol specimen and the sputum specimen.
 14. A method for detectingNSCLC cancer in a subject comprising the steps of:comparing (a) thelengths of a plurality of microsatellite markers from a nucleic acidsample isolated from a sputum specimen of the subject with (b) thelengths of said plurality of microsatellite markers from a controlspecimen of said subject, wherein said microsatellite markers areselected from the group consisting of: ARA, D14S50, AR, MD, SAT, ACTBB2,FGA, and UT762; identifying the presence of NSCLC if a variation inlength is found between a marker from the control specimen and thesputum specimen.
 15. A method for detecting Small Cell Lung Cancer(SCLC) in a subject comprising the steps of:comparing (a) the lengths ofa plurality of microsatellite markers from a nucleic acid sampleisolated from a lung specimen of the subject with (b) the lengths ofsaid plurality of microsatellite markers from a control specimen of saidsubject, wherein said microsatellite markers comprise a repeat sequenceselected from the group consisting of: CAG, CAA, and AAAG; identifyingthe presence of SCLC if a variation in length is found between a markerfrom the control specimen and the lung specimen.
 16. A method fordetecting SCLC cancer in a subject comprising the steps of:comparing (a)the lengths of a plurality of microsatellite markers from a nucleic acidsample isolated from a lung specimen of the subject with (b) the lengthsof said plurality of microsatellite markers from a control specimen ofsaid subject, wherein said microsatellite markers are selected from thegroup consisting of: AR, ACTBB2, and UT762; identifying the presence ofSCLC if a variation in length is found between a marker from the controlspecimen and the lung specimen.
 17. A method for detecting Non-smallCell Lung Cancer (NSCLC) in a subject comprising the steps of:comparing(a) the lengths of a plurality of microsatellite markers from a nucleicacid sample isolated from a lung specimen of the subject with (b) thelengths of said plurality of microsatellite markers from a controlspecimen of said subject, wherein said microsatellite markers comprise arepeat sequence selected from the group consisting of: AGC, TCC, TC, AC,CA, CAG, CAA, CTG, CAG, AAAG, and TCTT; identifying the presence ofNSCLC if a variation in length is found between a marker from thecontrol specimen and the lung specimen.
 18. A method for detecting NSCLCin a subject comprising the steps of:comparing (a) the lengths of aplurality of microsatellite markers from a nucleic acid sample isolatedfrom a lung specimen of the subject with (b) the lengths of saidplurality of microsatellite markers from a control specimen of saidsubject, wherein said microsatellite markers are selected from the groupconsisting of: ARA, D14S50, AR, MD, SAT, ACTBB2, FGA, and UT762;identifying the presence of NSCLC if a variation in length is foundbetween a marker from the control specimen and the lung specimen.
 19. Amethod for detecting head and neck small cell carcinoma (SCC) in asubject comprising the steps of:comparing (a) the lengths of a pluralityof microsatellite markers from a nucleic acid sample isolated from ahead or neck specimen of the subject with (b) the lengths of saidplurality of microsatellite markers from a control specimen of saidsubject, wherein said microsatellite markers comprise a repeat sequenceselected from the group consisting of: AGC, TCC, TC, CA, AC, CAG, CAA,AAAG, and TCTT; identifying the presence of head and neck small cellcarcinoma if a variation in length is found between a marker from thecontrol specimen and the head or neck specimen.
 20. A method fordetecting head and neck small cell carcinoma (SCC) in a subjectcomprising the steps of:comparing (a) the lengths of a plurality ofmicrosatellite markers from a nucleic acid sample isolated from a heador neck specimen of the subject with (b) the lengths of said pluralityof microsatellite markers from a control specimen of said subject,wherein said microsatellite markers are selected from the groupconsisting of: ARA, D14S50, AR, SAT, DRPLA, ACTBB2, and FGA; identifyingthe presence of head and neck small cell carcinoma if a variation inlength is found between a marker from the control specimen and the heador neck specimen.
 21. The method of any of claims 7-20 wherein at least3 markers are compared.
 22. The method of any of claims 7-20 wherein atleast 7 markers are compared.
 23. The method of any of claims 7-20wherein at least 10 markers are compared.